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It was known that cooperation of evolutionary prisoner's dilemma games fails to emerge in homogenous 
networks such as random graphs. Here we proposed a quantum prisoner's dilemma game. The game 
consists of two players, in which each player has three choices of strategy: cooperator (Q, defector (D) and 
super cooperator (denoted by Q). We found that quantum entanglement guarantees emergence of a new 
cooperation, the super cooperation of the quantum prisoner's dilemma games, and that entanglement is the 
mechanism of guaranteed emergence of cooperation of evolutionary prisoner's dilemma games on 

networks. We showed that for a game with temptation b, there exists a threshold arccos for a 
measurement of entanglement, beyond which, (super) cooperation of evolutionary quantum prisoner's 
dilemma games is guaranteed to quickly emerge, giving rise to stochastic convergence of the cooperations, 

that if the entanglement degree y is less than the threshold arccos then the equilibrium frequency of 
cooperations of the games is positively correlated to the entanglement degree y, and that if y is less than 

arccos ^ and b is beyond some boundary, then the equilibrium frequency of cooperations of the games on 
random graphs decreases as the average degree of the graphs increases. 

I t has been a longstanding challenge to understand the emergence and convergence (or guaranteed emergence) 
I of cooperations of games on networks, and the principles behind them. Our quantum prisoner's dilemma 
I game extends the classical prisoner's dilemma game naturally by introducing a super cooperator, and by using a 
measurement of entanglement, which we called entanglement degree as a metaphor for the relationship between 
players. This gives rise to a natural generation of the prisoner's dilemma game. Our game better reflects real games 
in nature and society. Our results explore a threshold theory for the role of quantum entanglement in emergence 
and convergence of cooperations in evolutionary quantum prisoner's dilemma games on networks, providing a 
foundation for exploring the emergence and convergence, and the principles behind, of cooperations of prisoner's 
dilemma games on complex networks. Equally important, we notice that our quantum game can be played by 
classic devices. This provides an example that quantum entanglement plays an essential role in games, and that 
the advantage of quantum games can be fully achieved by classic devices. 

The prisoner's dilemma (PD, for short) game is one of the well-known games, having implications in a wide 
range of disciplines. In a PD game, two players simultaneously decide their strategy C (cooperator) or D 
(defector). For mutual cooperation, both players receive a payoff R and receive P upon mutual defection. If 
one cooperates and the other defects, then the cooperator gains payoff S and the traitor gains temptation T. The 
payoff rank for the PD game is given by T > K > P £ S. In a PD game, the best strategy for both players is to defect 
regardless of the other's decision, in which case, the payoffs of the players are minimized. 

Real games are played in a system characterized by a graph in which the nodes are players, and the edges 
represent the games played by the two endpoints of each of the edges. The games in a graph are evolving by 
rounds. In this case, emergence of cooperation of the evolutionary games on a graph implies a maximal global 
payoff evolved from minimal local payoffs of the graphs. It plays an essential role in the organizations of the 
graphs such as systems of biosphere and human society. 

The reasons why does cooperation emerge in evolutionary games on a system could be many, one of which 
seems the special structures of the system. This is indeed the case. We know that social structures do play essential 
roles in emergence of cooperation, referred to 1 3 . Nowak and May found that cooperation emerges in a very 
simple spatial structure 4 . Santos and Pacheco showed that the scale-free networks promote emergence of coop- 
eration, with higher cooperation ratios than that on random graphs for both PD games and the snowdrift games 
(SG) 5 . All these results showed that heterogeneity does promote emergence of cooperation in different games. 
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This property has also been found in other games, for instance, in the 
public goods games (PGG) 6 , which further demonstrates that het- 
erogeneity of networks, a metaphor for social diversity, favors the 
emergence of cooperative behavior. On the other hand, cooperation 
is unlikely to emerge in the evolutionary prisoner's dilemma games 
on homogeneous networks, referred to 7 9 . 

This means that network structure is essential to emergence of 
cooperation of the PD games. However, in real world, we are usually 
unable to change the network structures. A fundamental question is 
thus: How can we guarantee emergence of cooperations of the pris- 
oner's dilemma games on general (probably random) graphs, includ- 
ing both heterogenous and homogeneous networks? 

In the present paper, we will answer this question. For simplicity, 
we choose the parameters of the PD game such that R = 1, P = S = 0, 
and T = b for b with 1 < b < 2. 

Before answering the question, we reexamine the prisoner's 
dilemma game to open the window for a new form of cooperations. 
The new possibility is that in classic prisoner's dilemma game, both 
players are independent of each other, and each of them is absolutely 
selfish and perfectly rational. However, in real world games, the two 
players in a game may always have some entangled relationship, 
which is the way that quantum comes into the game naturally. 

Quantum Prisoner's Dilemma Game 

Eisert et al in 1999 10 proposed a quantum game in which each player 
chooses a strategy from a unitary operator: 

* ,. m / cos 0/2 sinO/2 \ 
\ -sin0/2 e-'<*cos0/2 J 

71 

for <j> and 0 ranging from 0 to — and 71 respectively. 

Li et al 11 ' 13 implemented some experiments on evolutionary 
quantum games on some networks, in which each player randomly 
chooses a strategy from the whole unitary space. The experiments 
showed that quantum strategies promote cooperations in the games 
for some choices of the strategies. In particular, for two special 
quantum strategies, the Hadamard operation and the Q strategy 
(the same as our super cooperator), the experiments in 12 showed that 
the grid network is most easily invaded by quantum strategies and 
that a scale-free network can be invaded by agents adopting quantum 
strategies only if a hub of the network is occupied by an agent with a 
quantum strategy or if the fraction of agents with quantum strategies 
in the population is significantly large. Pawela and Sladkowski 14 
investigated the evolution of strategies on hyper-graph networks 
with three quantum strategies. The experiments in 14 showed that 
in the case of a three player game on a hyper-graph networks, 
quantum strategies are not necessarily stable strategies, and that in 
some cases, the defection strategy is as good as a quantum strategy. 
Chappell, Iqbal and Abbott 15 proposed an analysis of three-player 
quantum games in an EPR type setup by using the Clifford geometric 
algebra. 

Here we propose a quantum prisoner's dilemma (QPD, for short) 
game. Our idea is to introduce a special quantum strategy, written by 
Q, which joins the classical prisoner's dilemma game as a super 
cooperator. Therefore, our game consists of three strategies: C, D 
and Q, given by C= f)(0, 0),D=U(n, 0), and Q=U(0, n). 

Our game consists of two players, Alice and Bob. We assume that 

Alice and Bob have an entanglement degree y e ^0, — j . 

Then the payoff matrix of our quantum prisoner's dilemma game 
is thus given by Table 1. 

In our game, we choose just one quantum strategy, Q = U(0,n), 

71 

and allow quantum entanglement degree y to change from 0 to — . We 

notice that the payoff matrix is deduced from quantum mechanics, 
but the game itself can be simulated classically. By this reason, our 



quantum prisoner's dilemma is a game based on quantum mechanics 
that can be played by both quantum and classic devices. More impor- 
tantly, as we will see, that this generalization solves many problems of 
the prisoner's dilemma games. All the games in the experiments of 
our paper are played classically. 

From Table 1, we have the following results: 1) If y = 0, then the Q 
strategy collapses at C; 2) If both players choose the strategy Q, then 
they will get payoff 1, the same as that they both choose the strategy 

71 

C, and Q never takes advantage from C; 3) If y = — , then a (Q, C) 

71 

game gets payoffs (0,0) respectively, and Sty = — , then a (Q, D) game 

gets payoffs b and 0 respectively, meaning that Q takes advantage 
from D, and if y > 0, then Q always reduces the gain of a D strategy; 
and 4) when y increases, (Q, Q) becomes more and more a stable 
game than the (C, C) game due to the entanglement of the two 

71 

players, and if y = — , then (Q, Q) is a Nash equilibrium that max- 
imizes the payoffs. 

By 1), the classical prisoner's dilemma game is a special case of our 

71 

game. By 2), we can interpret Q as a cooperator. By 4), if y = — , then 

Q is completely different from C, meaning that Q is indeed a new 
strategy. By 3), Q is against D. By 4), the game (Q, Q) would be a Nash 
Equilibrium, and simultaneously be Pareto optimal. Therefore, our 
game is a natural generalization of the prisoner's dilemma with a new 
strategy, the super cooperator Q. 

We explain that our Q- strategy and quantum prisoner's dilemma 
game capture new phenomena in nature and society. In biological 
species evolution, initially we have a population C, later on, a species 
of invaders D appears, the invaders take advantage from the popu- 
lation C. In this case, there will be a new type mutated from popu- 
lation C, which is the Q-strategy. The new type Q is consistent with C, 
but against the invaders D. Eventually, the new type Q will evolve as a 
new dominating species. In human society, we interpret the honest 
crowds as C, and dishonest crowds as D. To prevent the invasions of 
the C-strategy crowds from the D-strategy invaders, a party Q nat- 
urally and gradually grows up from C to restrain D so that D becomes 
a minority in the society. Our quantum prisoner's dilemma game 
may explain some new phenomena in economics. For instance, it 
explains the reason why a win-win strategy in economics is possible 
and provides a mechanism to guarantee such a win-win strategy in 
economical activities. A remarkable characteristic of our quantum 
prisoner's dilemma game is that the quantum game is completely 
different from the classic prisoner's dilemma game, and that our 
quantum prisoner's dilemma game can be played by classic devices. 
This provides a remarkable example to show that classic devices fully 
realize our quantum prisoner's dilemma game, which is essentially 
different from the classic prisoner's dilemma game. This means that 
some theoretical advantage of quantum mechanics can be fully 
achieved by classic devices, leading to potential new physics. 

Evolutionary Quantum Prisoner's Dilemma Games 

Evolutions of our quantum prisoner's dilemma games on networks 
are defined similarly to that of the classical prisoner's dilemma 
games. 

Let G = ( V, E) be a graph. The evolutions of our games proceed by 
stages: At stage 0, for every node v £ V, define the strategy of v, 



Table 1 The expectation payoff matrix of our quantum prisoner's 
dilemma game 


C 


D 


Q 


C (1,1) 
D [b, 0) 

Q (cos 2 y, cos 2 y) 


(o,fa) 

(0, 0) 
[b ■ sin 2 y, b ■ cos 2 y) 


(cos 2 y, cos 2 y) 
[b • cos 2 y, b • sin 2 y) 
(1,1) 



SCIENTIFIC REPORTS | 4:6286 | DOI: 10.1038/srep06286 



2 



written by s( v) [0] , to be one of the C, D and Q with probabilities pi, p 2 

andp 3 respectively. (In our experiments, we setp, = - for each i = 1, 

2, 3.) Suppose that the strategy of v at stage f, written s(v)[f], are 
defined for all v e V. Then for each X = C,D, or Q, let p^M De the 
fraction of nodes in V that share strategy X during stage t, and let 
P(v) [f] be the total payoff of the games between v and all its neighbors 
by the end of stage f. At stage t + 1, for every node v e V, pick 
randomly and uniformly a neighbor, u say, of v as its reference. If 

P(u) [t] > P( v) [r] , then with probability ; sgt 5 ( v ) [f + 

max{d u ,d v \-b 

1] = s(w)[r], where d x is the degree of node x in G, otherwise, then 
keep s(v) unchanged. 

Generally speaking, the evolutionary system may reach a dynam- 
ical equilibrium after sufficiently many stages 16 . However, the inter- 
esting thing is the emergence and stochastic convergence of 
cooperations. As mentioned above, for the classical prisoner's 
dilemma games, cooperation fails to emerge in homogeneous net- 
works 79 . We will see that this is no longer true for our quantum 
prisoner's dilemma games. 

Emergence of Cooperation of Games on 
Homogenous Networks 

We consider two classical models both generating homogeneous 
graphs. The first is the Erdos - Renyi (ER, for short) 17 model, and 
the second is the small world model 18 . In the ER model, for given n 
nodes, and a number p, for every pair of the nodes i and j, with 
probability p, create an edge between i and j. For the small world 
model, we give n nodes in a circle, create an edge between any two 
nodes of distance within a number r, and for each node z, with some 
probability p, change an edge from i to a randomly and uniformly 
picked node k. 

We depict the curves of frequencies of the C-, D- and Q-strategy 
nodes in the evolutionary quantum prisoner's dilemma games on a 
graph from the ER model in Figure 1 (a), and a graph from the small 
world model in Figure 1 (b). 

From Figure 1, we observe that: 1) The frequencies of the three 
strategies evolve with time, and finally reach some stochastically 
stable states; 2) In the graph of the ER model, the D-strategy nodes 
almost vanishes quickly, after which the majority Q-strategy nodes 
and a fraction of C-strategy nodes share the graph; 3) In the graph of 
the small world model, the C-strategy nodes almost vanish quickly, 
after which the majority Q-strategy nodes and a fraction of D-strat- 
egy nodes share the graph; and 4) In both graphs, Q quickly becomes 
the dominating strategy of the evolutionary games, and keeps as the 
dominating strategy once it emerged. This experiment shows the 
emergence of cooperation of evolutionary quantum prisoner's 
dilemma games on homogeneous networks. 

Classical and Quantum Prisoner's Dilemma Games 

We investigate the stochastically convergent strategies of the evolu- 
tionary quantum prisoner's dilemma games on random graphs gen- 
erated by the ER model with different entanglement degrees and 
different expected average degrees. Figures 2(a) and 2(b) give the 
curves of C-, D- and Q-strategies of the games for y = 0 and expected 
average degree d = 4 and 8 respectively. In this case, Q collapses at C 

so that the games are classical ones. Figures 2 (c) and 2 (d) give the 

n 

curves of C-, D- and Q-strategies of the games with y = — , and 

expected average degree d = 4 and 8 respectively. 

By observing Figure 2, we have the following results: (1) For y = 0. 
Then the D-strategy will quickly become the dominating strategy for 

71 

b greater than 1.2 and 1.1 for d = 4 and 8 respectively. (2) For y = — , 

and d = 4. Then the super cooperator Q will become the dominating 

n 

strategy for all b, including b = 2; and (3) For y — — and d = 8. Then 
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Figure 1 | The evolution of three strategies with steps on a graph of the 
ER model (the upper part) and one from the small world model (the lower 
part) respectively. The temptation factor b = 1.18, the expected average 
degree is d = 4 and the is y = n/12 for both the graphs. 

Q will be the dominating strategy for most b's, for which only if b > 
1.8, the D-strategy nodes may surpass the super cooperators. 

By(l),ify = 0, then the games are the classical ones, in which case, 
for non-trivial b's (not too small), D is always the dominating strat- 
egy, so that cooperation fails to emerge. By (2) and (3), if y is non- 
trivial (not too small), then Q will be the dominating strategy, unless 
b is too large. 

This experiment shows that our quantum prisoner's dilemma 
game is remarkably different from the classical PD game, and that 
super cooperation emerges on random graphs, unless y is too small, 
or b is too large. This poses the question: what is the relationship 
between y and b which guarantees the emergence of cooperations of 
the games on random graphs? 

Threshold of Entanglement for Emergence and 
Convergence 

We investigate the relationship between y and b that guarantees 
emergence of super cooperation of the quantum prisoner's dilemma 
games on random graphs of the ER model. 
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(a) d = 4, 7 = 0 



(b) d = 8, 7 = 0 





(c) d = 4, 7 = f 



(d) d = 8, 7 = f 



Figure 2 | The equilibrium fraction of three strategies is plotted as a function of b. In order to compare our results with the classical one, we did the 
experiment when the y = 0 and y = n/7 respectively. 



Given b 



(1, 2) and y e 1 0, —J , suppose that Alice and Bob play 



the game with payoff matrix given in Table 1. 

Vb 

Suppose that y > arccos — — . We consider the following scenario: If 
b 

Alice chooses a Q-strategy, then the maximal payoff of Bob is maxfl, 

Vb 

cos 2 y, b • cos 2 y}. By the assumption of 7 > arccos — — , the best payoff 

b 

of Bob max{ 1 , cos 2 y, b ■ cos 2 y} is minimized at 1 , which occurs only if 

Vb 

Bob chooses Q too. Therefore, if y > arccos — — , then by choosing Q, 

b 

Alice forces Bob to choose Q if Bob wants to maximize his payoff. At 
the same time, when both Alice and Bob choose Q, the payoffs of 
both players and the total game are maximized. By the same reason, 
in this case, (Q, Q) is the unique Nash equilibrium of our quantum 
prisoner's dilemma game. 

Vb 

This result implies that in the case of y > arccos — — , the (super) 

b 

cooperation is guaranteed to emerge in the evolutionary quantum 
prisoner's dilemma games and is guaranteed to dominate almost all 



nodes of the network, for arbitrarily given network, including ran- 
dom graphs. This gives, for the first time, a theoretical threshold for 
emergence and stochastic convergence of quantum prisoner's 
dilemma games on an arbitrarily given graph. 

The result above explores a remarkable role of the Q strategy that 

in the case of y > arccos — — , by choosing Q, a player may force its 
b 

opponent to choose a particular strategy. This is a phenomenon 
frequently occurring in real world games, but never in classical pris- 
oner's dilemma games. 

The result demonstrates that the Q-strategy plays a fundamental 
role in emergence and stochastic convergence of cooperations, that 
our quantum prisoner's dilemma game is remarkably different from 
the classical prisoner's dilemma game, and that our quantum prison- 
er's dilemma game does better reflect the nature of real world games. 

To understand the result, we verify it by experiments. In Figures 3 
(a), (b), (c) and (d), we depict the curves of each of the strategies for a 
random graph with expected average degree d = 8, and with b = 1.2, 
1.5, 1.8and2.1 respectively. In each of the figures, it does showthat,if 
Vb 

y > arccos — — , then the super cooperation strategy Q emerges, stoch- 
b 
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astically converges and dominates the evolutionary games on the 

Vb 

graph. For example, we choose b = 1.5, then arccos— — —35°. 

b 

From Figure 3 (b), we observe that, if y s 37t/16 = 33.75°, then 
the frequency of Q-strategy nodes is almost equal to 1. For b = 

Vb 

2.1, arccos— — =46°, by observing Figure 3 (d), if y s rc/4, then Q 
b 

emerges, stochastically converges and dominates the games on the 
graph. 

On the other hand, by observing Figures 3 (a), (b), (c) and (d), we 
Vb 

have that if y < arccos — — , then the equilibrium frequency of super 
b 

cooperations of the evolutionary quantum prisoner's dilemma games 
increases as the entanglement degree y increases. 

, , Vb 
We thus have that, if y> arccos-—, then super cooperation 

b 

emerges, stochastically converges (almost surely emerges) and dom- 
inates the evolutionary quantum prisoner's dilemma games, and that 

Vb 

if y < arccos —— , then the equilibrium frequency of super coopera- 



tions of the evolutionary quantum prisoner's dilemma games is posi- 
tively correlated to the entanglement degree y's. We emphasize that 
the results here principally hold for games on, not only random, but 
also arbitrarily given networks, although there must be new phenom- 
ena for different structures of networks. 



The Role of Structure of Graphs 

Vb 

We have seen that if y > arccos — — , then super cooperation emerges, 

b 

stochastically converges and dominates the evolutionary quantum 
prisoner's dilemma games on random graphs, and that if 

Vb 

y < arccos — — , then the equilibrium frequency of super cooperation 
b 

is positively correlated to y. 

Vb , 

However, if y < arccos — — , then super cooperation may emerge, or 
b 

may fail to emerge, depending on both the entanglement degree y and 
the structure of the network. What roles do the structures of networks 
play in the emergence of cooperation of the games on the networks? 





(a) b - 1.2 



(b) b = 1.5 





(c) b = 1.8 



(d) b = 2.1 



Figure 3 | The equilibrium fraction of three strategies is plotted as a function of /when b = 1.2, b = 1.5, b = 1.8, b = 2.1 respectively. The size of the 
network is 10000 and the average degree is 8. 
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(a) 7=5 



(b) 7=7 




(c) 7=9 (d) 7=11 

Figure 4 | The equilibrium fraction of super cooperators (the Q-strategy nodes) is plotted as a function of b with different values of expected average 
degree z. 



To understand this, we study the curves of equilibrium frequencies 
of super cooperators of the games on random graphs of the ER model 
with different expected average degrees. 

In Figures 4 (a), (b), (c) and (d), we depict the curves of super 
cooperators of four graphs with expected average degrees z = 4, 8, 16 
and 32, and with entanglement degrees y = n/5, y = nil, y = n/9, y = 
n/ll respectively. 

From the experiments, we observe the following results: (1) If the 
b's are very small, then the equilibrium frequencies of super coopera- 
tions increase as the average degrees of the random graphs increase, 
and that if the b's are appropriately large, then the equilibrium fre- 
quencies of cooperations decrease as the average degrees of the ran- 
dom graphs increase. This shows the role of structures and explores 
an interesting new phenomenon, calling for a new theory to explain. 

Conclusions and Discussion 

We proposed a quantum prisoner's dilemma game and investigated 
the emergence, and stochastic convergence (guaranteed emergence) 
of (super) cooperation of the evolutionary quantum prisoner's 
dilemma games on random graphs. Our results can be summarized 



as follows: 1) if }'> arccos — — , then (super) cooperation always 
b 

emerges and stochastically converges on every network, 2) if 
Vb 

y < arccos — , then equilibrium frequencies of super cooperations 
b 

are positively correlated to the entanglement degree y on every net- 
Vb 

work, 3) if y< arccos — — and b is very small, then the equilibrium 
b 

frequencies of super cooperations on random graphs increase as the 

sfb 

average degrees of the graphs increase, and 4) if 7 < arccos — — and b 

b 

is appropriately large, then the equilibrium frequencies of super 
cooperations on random graphs decrease as the average degrees of 
the graphs increase. The first two results provide a theory for emer- 
gence and stochastic convergence of cooperations of prisoner's 
dilemma games on arbitrarily given networks. The later results 
explore a new phenomenon of emergence of cooperations, predicting 
a new theory for understanding the roles of structures of networks in 
emergence of cooperations. Our results pose a fundamental question 
for future investigation: Given a network G, find the least entangle- 
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ment degree y 0 such that for every y £ y 0 and for every b 6 (1, 2], 
super cooperation of the evolutionary quantum prisoner's dilemma 
games on G is guaranteed to emerge. This could be a general scientific 
problem, calling for solutions from mathematics, physics, infor- 
matics, economics, biology and social sciences etc. Our game is 
derived from quantum theory, but can be played classically. This 
suggests a new direction to characterize the quantum theory that 
can be realized by classic devices. 
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